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An Asymptotically Efficient Backlog Estimate 
for Dynamic Frame Aloha 

Luca B arietta, Flaminio Borgonovo, and Matteo Cesana 

Abstract 

In this paper we investigate backlog estimation procedures for Dynamic Frame Aloha (DFA) in 
Radio Frequency Identification (RFID) environment. In particular, we address the tag identification 
efficiency with any tag number N, including N — > oo. Although in the latter case efficiency e _1 is 
possible, none of the solution proposed in the literature has been shown to reach such value. We 
analyze Schoute's backlog estimate, which is very attractive for its simplicity, and formally show 
that its asymptotic efficiency is 0.311. Leveraging the analysis, we propose the Asymptotic Efficient 
backlog Estimate (AE 2 ) an improvement of the Schoute's backlog estimate, whose efficiency reaches 
e _1 asymptotically. We further show that AE 2 can be optimized in order to present an efficiency very 
close to e _1 for practically any value of the population size. We also evaluate the loss of efficiency 
when the frame size is constrained to be a power of two, as required by RFID standards for DFA, and 
theoretically show that the asymptotic efficiency becomes 0.356. 

Index Terms 

RFID, Collision Resolution, Anti-collision, Frame Aloha, Tag Identification, Tag Estimate. 

I. INTRODUCTION 

Dynamic Frame Aloha (DFA) is a multiple access protocol proposed in the field of satellite 
communications by Schoute [1]. This protocol has been rediscovered about a decade ago for 
Radio Frequency Identification (RFID), an automatic identification system in which a reader 
interrogates a set of tags in order to identify each one of them [2]. DFA and its modified 
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versions have then become very popular, as demonstrated by the large body of literature on the 
topic, being also adopted in reference standards for ultra-high frequency RFID systems 0, Hi. 

In brief, Frame Aloha (FA) operates as follows: tags reply to a reader interrogation on a slotted 
time axis where slots are grouped into frames; a tag is allowed to transmit only one packet per 
frame in a randomly chosen slot. In the first frame all tags transmit, but only a part of them 
avoid collisions with other transmissions and get through. The remaining number of tags n, often 
referred to as the backlog, re-transmit in the following frames until all of them succeed. 

Like many protocols of the Aloha family [5] also FA is intrinsically unstable and its throughput 
is very small unless stabilizing techniques are used. This is possible if a feedback on the 
outcome of previous channel slots is available, i.e., if the reader knows whether slots have 
been successfully used, not used, or collided, the latter meaning that two or more transmission 
attempts have occurred. Channel feedback is then used to derive a backlog estimate n and to 
set the frame length r accordingly. 

Assuming as performance figure the efficiency, defined as Tj(N) = N/L(N), where N the 
number of tags and L(N) the average number of slots needed to identify all tags, optimal 
conditions arise when n — n, i.e., the estimate is perfect, in which case the maximum efficiency 
is attained by setting r = n , reaching e -1 as N approaches infinity. For this reason, many 
proposals, as discussed in the next section, set the frame length r — h, which surely provides 
optimal frame efficiency when n — > oo. 

Since in RFID applications the population size N is an unknown constant, the initial estimate 
and frame size can not be optimally set, and some frames are initially spent inefficiently waiting 
for the estimate n to converge. The inefficiency of the initial phase constitutes an overhead 
that impairs the performance with respect to the optimal case, the impairment depending on the 
setting of the initial frame size and the value of N. This behavior is common to all proposals 
appeared in the literature and, as discussed in the next section, none of those protocols can reach 
efficiency e _1 asymptotically. 

In this paper, we deal with the theoretic issue represented by the asymptotic efficiency when 
the initial frame length r is finite. Although many papers have appeared proposing variations 
of the basic DFA scheme, here we refer to DFA versions that are strictly compatible with the 
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ISO and EPC standards J5J, flU, not considering proposals that introduce changes in the way 
the mechanism operates, sometimes borrowing mechanism from other protocol families such as 
Tree protocols. The essential literature, comprising also proposals of the latter type, is given for 
example in Q, II. 

We focus on the efficiency of the identification procedure. We are aware that this figure can not 
consider all the details of practical implementations, i.e., the overhead of different commands or 
the fact that the slot length may change from empty to non-empty slots; nevertheless, this figure 
is commonly used in the literature to compare different protocols since it captures the essence 
of the mechanisms, that is, the ability of the backlog estimate of minimizing the identification 
period. 

The main contribution of this paper is twofold. First we present an asymptotic analysis of 
Schoute's estimate. This estimate is attractive for its simplicity, but has been often considered 
inefficient and never analyzed in detail to show its potentials. Here we show that its asymptotic 
efficiency, when the initial frame length is set to any arbitrary value, is 0.311. We then extend 
Schoute's estimate and propose the Asymptotic Efficient backlog Estimate (AE 2 ), an estimate 
that, using the ability to restart the frame at any time even though the preceding frame is 
not finished (the Frame Restart capability of the standard), is analytically shown to reach the 
asymptotic benchmark e _1 . We finally address two practical issues; namely, we show how to 
tune AE 2 in such a way that it can guarantee efficiency close to e _1 for any value of N, and then 
derive the efficiency when the frame size is constrained to be a power of two, as required by 
RFID standards for DFA. In the latter case we theoretically show that the asymptotic efficiency 
becomes 0.356. 

The paper is organized as follows. In Sec. HH we discuss the performance of the most relevant 
backlog estimates appeared in literature and state the problem. In Sec. Unl we provide a novel 
analysis of the original DFA that sheds light on its asymptotic behavior. In Sec. [TV] we introduce 
the new estimation mechanism and evaluate its asymptotic efficiency. In Sec. |V] we address 
practical issues, namely the tuning of the mechanism for best performance with finite N, and the 
evaluation of the impairment due the constraints introduced by RFID standards. Our concluding 
remarks are reported in Sec. [VT] 
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II. Background and Previous work 

In his seminal work, Schoute proposes to estimate the backlog n by counting the number 
of collided slots c at the end of the previous frame. Assuming that the procedure is able to 
keep a frame size equal to the backlog n, the number of terminals transmitting in a slot can be 
approximated by a Poisson distribution of average 1, such that the average number of terminals 
in a collided slot is H — (1 — e~ l )/{\ — 2e _1 ) ~ 2.39 [0Q. The backlog estimate is consequently 
n = round(ifc), where round(x) is the closest integer to x. Moreover, Schoute shows that the 
average number of slots L(N, r) to solve N collisions starting with a frame of length r is given 
by: 

L{N,r) =r + J2 Ps,c{N,r)L{N - s,r), N>2, (1) 

s=0 c=0 

where p SjC (N, r) is the joint probability of having s successes and c collisions, m = min{iV — 2, r — 1}, 
and f, equal to the estimate of the backlog N — s, is function of N, r, s, c. The set of equations 
(OQ) can be solved starting from L(0,r) = L(l,r) = r. 

In the past decade, many papers have faced the problem to adapt DFA to the RFID environ- 
ment. Most of them deal with two fundamental problems: backlog estimation and frame size 
determination. Regarding the latter problem, equation (Q3, when numerically evaluated, shows 
that it attains the minimum for r = N. In fact, this optimality has been recently proven in |Q, 
where it is also shown that the asymptotic efficiency is e _1 . When n is unknown and there is 
no constraint on the frame length, Schoute and subsequent authors have often assumed r — n. 
Recently it has been shown that setting r = n is not always optimal unless h = n. Here, we 
focus on the problem of backlog estimation and review some of the backlog estimates proposed 
in literature, especially referring to asymptotic conditions where good estimates converge to the 
true value n. 

Vogt ifTOl introduces two estimation algorithms. One is the lower bound estimation n = 2c, 
the other is the minimum distance vector based on Chebyshev's inequality theory. The proposed 
schemes are devised for a limited set of frame lengths and population size. 

In IfTTTl . the authors consider two estimates, the collision ratio c/r and n = 2.39c, the same as 
Shoute's except for dropping the closest integer operator. Results on optimal frame length and 
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collision probabilities are re-derived. The average time identification periods of both proposals 
up to 900 tags appear practically equal. 

In lfT2l . the value n = n that maximizes the a posteriori probability Pr(n|s,c, e), having 
observed s successes, c collisions and e empty slots, is assumed as estimate. In practice, this 
proposal uses a Maximum Likelihood (ML) method since no a priori distribution of N is given. 
Pr(n|s,c, e) is obtained assuming independence among slots' outcomes. This estimate is shown 
to provide better performance than the previous ones, yielding an efficiency i] = 0.357 for 
N = 250, which drops to rj = 0.277 for N = 50. 

The drawback of this approach is its sensitivity to the initial frame size r , which may severely 
impair the performance when the initial tag population is not known, as in most RFID application 
scenarios. In fact, it is known that the ML does not work properly when the frame is completely 
filled by collisions; in this case, the ML mechanism tends to overestimate the backlog, and the 
probability Pr(n|s,c, e) is maximized by n = oo. Even if a bound iV max is adopted for the 
population size, if r is small and total collision arises, the overestimation causes inefficiency; 
on the other side, if r is large, inefficiency arises when N is small. 

This problem is common to other proposals that are analyzed, for example in 0, which 
reports the evaluation of seven estimation mechanisms, including the aforementioned ones. All 
of them, except h = 2.39c, largely overestimate (up to ten times and more) the backlog when 
the frame is filled by collisions. The same paper also compares the efficiency of such protocols, 
none of which is shown to reach the benchmark e -1 for large N. For example, the maximum N 
evaluated is N = 1000, where the best mechanism considered presents an average identification 
period whose length is slightly below 300, yielding an efficiency of about 0.33. 

A different class of estimates is given by the Bayesian estimate in [13] that evaluates the a 
posteriori probability distribution of the original population size N, conditioned to all the past 
observations, starting from the a priori distribution of the number of transmitting tags. Here the 
problem is that such a priori distribution is not known, and can not be hypothesized with an 
unbounded N. 

The above discussion shows the inadequacy of the proposals when the population size does 
not have a known distribution or the population size is large. However, to better sustain this 
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inadequacy, we have re-evaluated the performance of the two most relevant schemes, namely 
the Schoute's proposal and the Bayesian one. 

In Fig. Q] the efficiency for different values of iV with initial frame length r = N, r = 1, 
r = 10 and r = 100 is shown. Values up to N = 30 have been evaluated using (OQ), whereas 
values for N = 500 and N = 1000 have been obtained by simulating the algorithm. To allow 
comparisons we have also reported the performance with a perfect estimate n = n (dashed 
line), that represents a benchmark for all estimation mechanisms. We have also reported the 
case where only the estimate of the first frame is perfect, i.e. when the first frame length is set 
to N. The comparison of the latter cases shows that Shoute's mechanism is able to track the 
backlog quite well if compared to the perfect estimate case, asymptotically reaching the best 
possible efficiency e~ l . In all the other cases, Schoute's estimates suffer the mismatch between 
N and the initial frame length r, and the efficiency degrades monotonically when N increases 
beyond r, indicating the existence of a possible asymptote well below e -1 . The reasons for such 
behavior are analyzed next. 

Figure |2] shows the efficiency of the Bayesian method when the population size distribution is 
Poisson with different averages. We see that the efficiency is optimal only when N lies around 
the average. In all cases, when N increases beyond some maximum value the efficiency drops 
to zero, meaning that the estimate converges to a constant value well below the actual value, so 
that successes are very rare. In Fig. [3] the performance with a uniform population size shows a 
similar behavior. 

None of the two methods just shown seems to behave well asymptotically; however, the 
Schoute's method is by far the simplest of the two and presents characteristics, namely the 
ability to remain locked to the true value, that suggest that it can be improved to reach maximum 
asymptotic efficiency. To this purpose, we present in the next section an asymptotic analysis that 
suggests how to reach the target. 

III. Asymptotic analysis of Schoute's method 

In the following analysis we assume that N is very large, since we are interested in investi- 
gating the efficiency for N — > oo. To facilitate the reader, we proceed in steps. In the remainder 
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of the paper lower case letters represent random variables, whereas calligraphic upper cases 
represent averages. 

Step 1. Here we derive recursive formulas for the backlog. We initially assume that the 
frame size r i5 and the backlog are so large that the number of transmissions in a slot can be 
approximated by a Poisson distribution with average nj/Vj. This allows to evaluate the probability 
of an empty, successful and collided slot as 

Pe = e -m/n. ps = m e -m/ ri . Pc = l- V± e -ni/n - e- n ' /r \ 

We note that relations above also hold when starting with small r, because in this case, being 
N — i always very large, every slot is collided with probability one. In Appendix A we show 
that, in the conditions assumed, the ratio ki = Ui/ri can be safely replaced by the ratio of 
the respective averages /Q = Mi/lZi, which is the traffic per slot. With this substitution the 
probabilities above are denoted by V e ,V s ,V c . This means that the average number of collisions 
and the average backlog size can be expressed as 

Ci = KiV c , M l+l =Mi{l-V s ). (2) 

The frame length evolves with law r i+1 = round(ifq), so that 

Ki+i = E {round(Fci) } , (3) 

where E{-} is the expectation operator. Equations © and © form a recursion that provides 
sequences {IZi} and {Mi} that determine the efficiency. Unfortunately the rounding operation 
in © makes their analysis practically unfeasible. 

Step 2. When q is large, exploiting the limit lim^oo round (x)/x = 1, we can replace 
round(ifcj) in © with Hci, obtaining 

Ki+x = E{round(Fc 4 )} ~ HE{ Cl } = HC,. (4) 

If we use © in the recursion in place of © we get sequences 

R l+l = HRi (1 - K ie - Kt - e~ Ki ) , (5) 

N i+1 = Ni(l- e~ Ki ) , (6) 



1 1 - e~ Ki 

K i+ x = Ki — - T ^ _ K - Z j^, (7) 



H 1 - Kie~ K * - e" 



that correspond, respectively, to sequences {R4}, {Ni}, and {Ki}. Later on we prove that 
replacing {Hi}, {Mi}, and {/Q} with the above sequences has no effect on the evaluation 
of the asymptotic performance. Also we prove that this holds even for finite values of the initial 
frame size r . In practice, we find that sequence {Ri} approximates fairly well sequence {Ki}, 
even for moderate values of N, and this allows recurrence © to be used in practice to evaluate 
the performance. 

As an example, Fig. @] shows sequence {Ni} derived by averaging 10 4 simulation samples 
in the case N = 10 3 and r = r = 1. We can clearly see a first phase where the estimate 
increases in order to converge to the true value N = 10 3 . In a second phase, optimal conditions 
are met, collisions are solved and the backlog decreases steadily to reach zero at about the 25-th 
iteration. We explicitly note that in the descending phase the rate of descent is e _1 , showing that 
Schoute's algorithm is capable to correctly track the backlog and to solve contentions in the most 
efficient way. Figure |4] also shows the relative error sequence {(Mi — iVj) /Ni} multiplied by 10 3 
(dash-dotted line). The error is always very small except at the end of the process, where Ni 
becomes small and ignoring the rounding effect is no longer appropriate. However, this error has 
no effect on the efficiency since it occurs for a small period of time, negligible when compared 
to the entire collision resolution length. 

Step 3. The evolution of the entire process is represented by recurrence © that depicts the 
evolution of average traffic K^ This can be represented by the dashed trajectory in Fig. [5] This 
figure also shows that the evolution of the process is asymptotically stable since recurrence © 
leads to the fixed point in Ki = 1. This point is also a point of optimality because in here we 
attain the optimal condition r, = rii that provides maximum throughput. 

When the starting point in © is K = K = 1, the collision resolution process proceeds with 
a correct backlog estimate, yielding Ki = 1, for all subsequent i and we have 

R l+1 = (1 - e- l )R tl 1 > 0. (8) 

The solution of the recurrence © is Ri = (1 — e _1 )W, for i > 0. The total number of 
slot in this resolution phase is L(N) = YliLo-R-i = yielding an asymptotic throughput 
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N/L(N) = e _1 . When K = N/r > 1, the length of the entire procedure can be evaluated as 
L(K) = ^2°Z Ri = r ^So a *' w * m -^o — r > where sequence {a { = Ri/r} is always the same, 
for a given K, whichever r is. Therefore, the efficiency is evaluated as N/L = Kj YliLo a « an( ^ 
only depends on K = N/r. 

Step 4. Here we show that for large values of initial traffic K the dependence of the efficiency 
on K is negligible. Since the protocol always starts with a finite r, large N means large K, so 
we attain practically the same efficiency whichever the initial frame length r is. 

As an example, in Fig. [6] we have reported the efficiency N/L(N), evaluated through © and 
CO), for different values of traffic K. Starting from K = 1 the efficiency at first decreases as 
K increases until about K = 500 where it begins to oscillate without reaching an asymptote, 
around a mean value of 0.31125, with a period that increases geometrically with H. 

To attain some insight on the asymptotic behavior, during the solving process we consider 
three phases. The first phase, the approaching phase, starts at frame with infinite traffic and 
ends at frame u where u is chosen in such a way that the traffic K u is finite and practically no 
successes occur up to frame u; as an example we may arbitrarily assume u such as K u > 10. 
Although in this way K u and u appear arbitrarily defined, we show below that this has no effect 
on the evaluation of the efficiency, as, in fact, the initial traffic K has no effect. The assumed 
definition for u assures that u — > oo as N — > oo and R u = N/K u . 

The second phase, the convergence phase, starts at frame u + 1 and ends at frame u + v such 
that K u+V ~ 1. At this point the third phase, the tracking phase, begins where tags are solved 
with efficiency e -1 . Denoting by L', L" , and V" the length of the three phases respectively, the 
efficiency is evaluated as N/L(N) = N/(V + L" + V"). 

With high values of K = N/r, in the first phase the frame length increases deterministically 
with law Ri = rH\ for i > 0. The average number of slots up to frame u where the first phase 
ends is 




i=0 



Replacing R u = N/K u , the average length of the first phase becomes 



V 



H N 



NA(K U ). 



H -IK, 



u 
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where A(K U ) is the proportionality constant where we have made explicit the dependence on 
K u . The second phase starts at frame u + 1, when K u is such that the collision probability is 
practically one, and ends at frame u + v when K u+V ~ 1. Equation © can be used to evaluate 
the length of phase two by the following sum over a finite number of terms: 

V V 

3=1 3=1 

where terms ctj are all finite and, again, where B(K U ) is the proportionality constant where we 
have made explicit the dependance on K u . The average backlog size at the end of the second 
phase can be evaluated by © as 

V 

N" = N u+V = N u 11 (1 - e~ K ^) = NC, 
i=i 

where we have exploited the fact that N u = N. Coefficient C does not depend on K u , since in 
frame u + 1 we still have all collisions (e~ Ku+1 ~ 0). 

The third phase presents efficiency e _1 and its average length is V" = N"e = NCe. The 
efficiency with very large iV is then 

N N 1 
= = • (9) 

L(N) L> + L" + N"e A + B + Ce K 

We note that © does not depend on the choice of v, once the condition K u+V ~ 1 is assured. 
If we replace v by v + 1, coefficient A is not affected, and also term B + Ce is not affected. In 
fact, B is augmented by the term R u+v+ i which, by © with K u+v+ i ~ 1, is equal to 



it! u+ „ +1 = iV u+ „(l-e- 1 ). (10) 

On the other side, term Ce is diminished by (N u+V — N u+v+ i)e = N u+V (l — e _1 ), that is equal 
to term (flOl) . Nevertheless, efficiency © does depend on the choice of K u , through coefficients 
A and B. However, if we replace K u , chosen as suggested above, with K u ■ H, efficiency © 
does not change because this only implies the shifting of term R u from term A to term B. 
Therefore, the efficiency is periodic in a logarithmic scale and all the asymptotic amplitudes 
of the oscillations in Fig. [6] can be obtained by replacing K u with any value K' in the range 
(K U ,HK U ). 
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Table U shows the efficiency attained by © for different values of K u chosen in the range 
(20, 20H). As we can see, the values fit very well to those shown in Fig. [6] and, for all practical 
purposes, the asymptotic efficiency can be assumed equal to 0.311. 

Step 5. Now we show that replacing L', L", and L'", in the limit r — > oo, with £', £", and 
£"', in which the rounding operation is taken into account, does not change the results provided 
that the initial frame length is still r. In Appendix B we show that 

We also have lim^oo £" '(N) / L" '(N) = 1, because the second phase is composed of a finite 
number v of frames, each of them so large that the rounding effect is negligible. What shown 
also implies that at the end of the second phase we have lim^oo A/i/iV, = 1, and, therefore, 
since those tags are solved with efficiency e -1 , also for the length of the third phase we have 
lim r ^ QO £"'(N)/L"'(N) = l. 

Step 6. If r is small and © can not be assumed, the first phase is split into two sub-phases 
in which the second sub-phase starts at frame-index x such that, from this frame onward, the 
rounding operation in © can be disregarded. Index x is finite and the length of the first sub- 
phase does not depend on N, whereas the length of the second sub-phase and of the other phases 
is proportional to N. Therefore, as N — > oo, the length of the first sub-phase vanishes and the 
asymptotic efficiency remains approximately 0.311 even with small r. 

IV. An Asymptotically Efficient Estimation Procedure 

The analysis of Schoute's estimate carried out in the previous section has shown that the 
reduction of the asymptotic efficiency with respect to the theoretical value e _1 , when starting 
with a finite estimate, is not due to an intrinsic inefficiency of the estimate, but rather to the 
phase in which traffic K converges to 1, that is the convergence phase composed of V and L", 
whose length increases linearly with N. In particular, the length of this phase increases linearly 
with N because the frame length increases exponentially as H\ and this, from the overhead 
point of view, is a complete waste of time, since in this phase almost no success occurs. On 
the other side, the frame increase is needed to reduce the traffic per slot and get locked to the 
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optimal point K = 1. To get a good estimate of traffic K, we need not to explore the entire 
frame or, in another view, we need not to let all tags transmit in the frame; therefore, during 
the approaching phase toward K = 1 the frame can be shorter and provide a convergence phase 
with an average length L' + L" such that 

lim — - — = 0. (11) 

iV^oo N 

A way to reduce the number of tags transmitting in the frame is to specify at the beginning 
of the frame the transmission probability, together with the frame length. An alternative way, 
that is entirely compatible with the EPC standard, is to let all tags chose a slot in the frame, 
as in normal operation, but re-starting a new frame before the exploration of the entire frame is 
completed. Therefore, if we call virtual frame, of length z, the frame in which all tags select a 
slot for transmission, and real frame, of length r < z, the frame that has been explored when 
the frame is re-started, the traffic per slot n/z is determined by the virtual frame, but only the 
real frame is observed and used to determine n. 

The estimation procedure we present in this section, the AE 2 , adopts a virtual frame whose 
length Zi is set equal to the backlog estimate hi, as it happens in the Schoute's mechanism. 
Furthermore, the real frame length is set so that, in the convergence phase, it increases with 
index i far less than the virtual frame length. As in DFA, backlogged terminals choose a slot 
in the virtual frame length Zi and transmit in it only if the chosen slot belongs also to the real 
frame, i.e., if the Frame Restart command has not arrived yet. By setting a suitable law for r», 
the length of the first two phases can be easily forced to obey (flTT) . 

The backlog estimate is updated as follows 

z i+ i = h i+1 = round ( if*— c* J , Q > 0, 

V n J (12 ) 

Update (PT2l) is a variation of Schoute's algorithm. It uses the number of collided slots q in the real 
frame to get an estimate, (zj/r^q, of the number of collided slots in the virtual frame, multiplied 
by the factor H^. When q = 0, however, no estimate can be inferred by the observation, and, 
therefore, the estimate is assumed identical to the one in the previous frame diminished by the 
observed number of successes Sj. In Schoute's work, where the algorithm is supposed to operate 
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in K = 1, we have Hi = H, for all i; in our case, however, such a setting cannot guarantee the 
convergence to K = 1. This issue is discussed in Sec. IIV-AI 

As for the length of the real frame r», we asymptotically use the law 

n = min{round((i + l) 6 ), Zj}, (13) 

with b > 0. In (fT3l . with large N, with the exception of the first few slots, the real frame size 
increases, at first, as (i+ l) b , and later, when Zi stabilizes and % is such that round((i + l) b ) > Z{, 
the real frame coincides with the virtual one and the procedure becomes the classic DFA. 

The proposed procedure resembles in some aspects the one in [fl4l . where the traffic is 
decoupled from the frame length by the introduction of a frame transmission probability less 
than one. However, unlike our proposal where estimation takes place within the identification 
phase, in lfT4l the identification phase is preceded by an estimation phase, introduced ad hoc, 
whose length increases as log N and depends on the length of the estimate confidence interval, 
that must be made quite small, as no estimation is operated during the identification phase. The 
latter characteristic can jeopardize the procedure since no certainty exists to identify all tags. 

A. Asymptotic Analysis of AE 2 

The analysis here presented is much the same as the one presented in Sec. |inj Therefore we 
limit our explanation to parts that differ. Adopting the same assumptions used in Sec. HH] we can 
write the recursions corresponding to ©-© as 

Z l+l = Zi Hi (1 - K ie - K > - e- K >) (14) 

1 _ ?± e ~Ki 

K ^ = ^ — — \ ^. (15) 

Hil — Kit K * — e K * 

The key recursion (TT3T > is different from © since now it also depends on Zi which complicates 
the matter. Since, for an efficient estimation we want Ki to converge to 1, sequence {Hi} must 
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Recursion (fT5l) is stable because it presents a unique fixed point in K = 1 and we have 

d f 1-2C- 1 l-.Be-* 
_1< a^\ l-fie- 1 1 - Ife"* - e~* 

for all i? G (0, 1]. Although values (PT6l) could be evaluated a priori, in practice we can assume 

1 - 

Hi = 

1 - 2c" 1 

Figure [7] validates the analysis so far carried out. In fact, it compares the results the analysis 
produces in terms of sequence {M} with exact values attained averaging 10 4 simulation samples, 
in the case N = 10 3 and r = 1. Again, the dash-dotted line represents the relative error multiplied 
by 10 3 , still very small. For comparison purposes we have also reported the curve in Fig. |4]that 
refers to Schoute's algorithm. We clearly see the advantage of AE 2 : the estimate A/i at first 
rises sharply reaching N with some overshoot, higher and sooner with respect to Schoute's case. 
Right after the estimate begins a steady decline with rate e _1 . 

What stated above is confirmed in Fig. [8]where we have reported sequences {A",} and = 
{Ri/Zi}. The former shows the convergence of the estimate in K = 1, while the latter reports 
the convergence of the real frame Ri to the virtual frame Zi. The protocol starts with r = z , 
e -Ki o anc i subsequently we have H ~ 2.39 as in Schoute's, which yields z\ = r\ — 2. 
Condition r< = z« is maintained up to i = 3 and then becomes z { > Bi decreases and when 
Ti Z{ we have Hi = H' ~ 1/(1 — 2e _1 ) ~ 3.78, and z i+ \ = H'zi, reducing the traffic more 
quickly than in Schoute's and speeding up the convergence phase, which is further reduced in 
time because the real frame is by far shorter. Bi reaches a minimum when Ki reaches one. At 
this point the real frame is so short that the collision solved are still very few. Beyond this point 
the protocol solves collisions with efficiency e _1 , Zi decreases and increases until condition 
Vi = Zi is reached again and never abandoned. From this point onward the backlog is solved 
exactly as in Schoute's algorithm. 

It is worth noting that the recursion in B does not get into its fixed point B = 0. In fact, once 
K = 1 is reached, by (O we can write B i+ i/Bi = R i+ i/Ri(l — Bie~ l )~ l > 1. 

Now we prove that the efficiency of AE 2 equals e _1 . The efficiency can be evaluated by 
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writing, as in Sec. Hill 



N 

V = ^ L'(N) + L"(N) + N"e > (17) 



where L'(N) is the average number of slots of the first phase in which there are no successes, 
L" (N) is the average number of slots of the second phase in which the estimate of the backlog 
converges to the actual backlog. As it has been already observed, in the first phase we have 
Hi = H', so that we have Zi = r(H')\ and assuming, as in the earlier analysis, that the first 
phase ends at frame u, where Z u = N/K u , solving the expression of Z u we get 

N 

u(N) = \og H ,—. (18) 
rK u 

The length of this phase can be bounded as 

6+2 



L\N) < |V < r ,+ v * = m + 1 * t+2 < ( l+log ^ 



(19) 



where last inequality follows by (fl"8l . therefore we have lirn/v-»oo L (N)/N = 0. The overhead 
of the second phase can be rewritten as 

L"(N) = (N - N")(e-e), < e < e, (20) 

where N" is the backlog size at the end of the second phase. At the end of the first phase we 
have 

_ A Ru K u u b K u (log H ,N -\og H ,rK u ) b 

H — 



Z u N N 
which implies hin^^oo B u = 0. In the second phase a few frames, v , are necessary to obtain 
K = 1, and we still have liniAr^oo B u+V = 0, which means that also the fraction of solved 
tags is asymptotically zero. Therefore, in (fTTl ) we have limjv-».oo N /N = 1, and by (1201) 
limAf^oo L"(N)/N = 0, so that (HTJ) yields i] = e _1 . 

V. Practical Issues 

A. Overall Optimization 

The analysis carried out in the previous section shows that AE 2 is asymptotically efficient 
whatever the values of parameters in (fl~3) are. Here we investigate the efficiency when N is 
finite, in the range [1, 10000]. Figure |9] shows the efficiency of the AE 2 procedure for three 
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different values of the parameter b, versus the number of tags N to be identified. We see that 
in all cases the convergence is assured, although with different performance. In fact, the best 
convergence is provided by b = 1, because it reduces the approaching phase overhead as predicted 
by (fT9l ). We see, however, that for values of N < 100, the performance appears to deteriorate in 
all cases, because of the small length of the observed frame r^, which increases the estimation 
variance. 

The above observation shows that laws (fT3l) and (fT2l) . though asymptotically optimal, are 
non-optimal with finite N. On the other side, those laws can be changed in a largely arbitrary 
way without affecting the asymptotic efficiency, as long as the former increases no more than 
polinomially, and the latter is an increasing function ultimately converging to (fT2l ). In the 
following we show a heuristic method to select those laws optimally, i.e., in such a way to 
provide nearly optimum performance for any N. 

In the sequel we refer to a mechanism that subdivides the procedure into two phases, namely 
the approaching phase, that is roughly composed of the approaching phase and the converging 
phase defined in Sec. [Till and the tracking phase. In the approaching phase we adopt, instead of 
(TT3T) and (fT2l) . laws optimally determined for this phase, and afterward we turn to (fTJb and (TT2T) . 
thus assuring the convergence of h and the asymptotic optimality. We can not precisely define 
when the approaching phase finishes, since n gets close to iV but can rarely match it exactly. 
However, as the estimation mechanism of AE 2 becomes effective when not all collisions are 
observed in the frame, we reasonably assume that the approaching phase finishes at the frame 
that shows the first non collided slot. 

In looking for optimal laws for the approaching phase, we are able to show that the two of 
them can be optimized separately. We assume at first that the estimate h (second law) is doubled 
at each frame. The performance indexes of the approaching phase are given by its overhead, 
that increases as the length of the real frame increases, and the accuracy of the approach of h 
to N, that can be measured by the traffic per slot N/zi = N/hi, at the end of the approaching 
phase. 

We now compare two cases, the first where all the frames of the approaching phase obey 
r, = 1, and the second when = 2. In both cases the estimate n is doubled at each frame, and 
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the approaching phase finishes when a non-collided slot is observed. Assuming that the traffic 
per slot is Poisson distributed, independently at each slot, an assumption that becomes more 
accurate as N increases beyond a few decades, we can easily find the a posteriori traffic s that 
maximizes the probability of observing a non collided slot preceded by a sequence of all collided 
slot. This traffic turns out to be si = 1.4 and s 2 = 1.85 for the two cases considered above. 
Simulations with N = 100 provides si ~ 1.5 and s 2 — 2 with standard deviations respectively 
equal to 0.9 and 1, while with N = 10 provides si ~ 1.4 and s 2 — 1.85 with standard deviations 
respectively equal to 0.6 and 0.7. These results clearly show that the approach of the first case 
is slightly better, i.e., closer to 1, than the second one. On the other side, increasing the frame 
length beyond one remarkably increases the overhead, so that the first case is definitely better 
than the second one. Based on this argument we may conclude that making 

n = l, t>0, (21) 

always provides close to best performance. 

In general, the estimate update can be represented by the law 

h l+ i = H%. (22) 

Repeating the above evaluations it appears that, as long as sequence {H^} is close to all twos, 
say in the range [1.5; 2.5], (|2TT) is still optimal. We now look for the optimal {HI} within the 
cited range. Here, we define the optimum sequence as the one that yields the highest among the 
minimum efficiencies observed in the whole range of N, so that the efficiency curve turns out 
as flat as possible. 

The problem simplifies in some ways. First, we observe that a sequence {H^} composed of 
all twos yields good performance above iV = 1000, while the minimum of the efficiency appears 
to be in the range N < 1000. In this range, doubling n at each frame, provides an approaching 
phase length that in the majority of cases lasts about 10 — 12 frames. This means that our 
optimum search can be limited to such a finite sequence. By further discretizing Hi in steps of 
0.1, we have been able to perform an exhaustive search that yields as optimal multipliers the 
sequence 

ff=[2 2 2 2 1.8 1.7 1.7 ...,1.7 ...]. (23) 
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We have prolonged the above sequence beyond i = 12 since this prolongation little, if at all, 
affects the results with high N. The corresponding efficiency curve is shown in Fig. [9] and 
positively compares with the curves seen before. Here, we can appreciate that the efficiency is 
always above 0.35, reaching e _1 asymptotically. 

B. Implementation issues 

As we have already stated at the beginning of Sec. [TV] real and virtual frames in AE 2 are 
compatible with the EPC standard set of commands. The procedure summarized in (fl"2l) . ([TTt . 
(|2TT) and (|23T) can easily be performed in the reader. However, in this standard, the frame size 
(virtual, in our case), conveyed by the reader to tags, must be of the type 2®, with Q in the range 
from one to 16. This constraint impairs the performance even if iV is known, since it prevents 
the optimal assignment r = n. 

Here we expose an approximate argument that is able to capture the asymptotic impairment 
due to the replacement of the optimal frame length n with the value closest to 2 Q , when the 
backlog n is known. 

Denoting by {rii} and {r-j} the sequence of backlogs and frame lengths, we assume that the 
random variable rii/ri, i.e., the average number of tags per slot in frame i, presents the same 
statistics regardless of the index i of the frame, which can be considered a good approximation 
for a large part of frames when N is very large. This leads to a frame efficiency rjf equal 
for all frames and, therefore, the identification period length L(n), starting with n tags, can 
then be written as L(ri) = J2i r i — n + n (l — Vf) + n (l — Vf) 2 + • • • = n/r]f, where, as 
usual when n is large, we have considered the random variables equal to their averages. This 
argument shows that the overall efficiency is rj — rjf. In order to determine rjf, we find the 
statistics of n»/ri, when the power index Qi of the frame length is assumed such that 2 Qi is the 
closest to rii. To the purpose, let assume that such index is Qi = q. This means that n — 
is in the range [2 q — 2 Q ~ 2 ,2 q + 2 9_1 ], being with equal probability higher or lower than 2 q . 
Similarly, s = n/r = n/2 q has a distribution that is bivariate, namely uniform within [3/4, 1] 
with probability 1/2, and uniform within [1,3/2] again with probability 1/2. The efficiency, or 
probability of a slot being successful, can be expressed as (1 — l/r)™^ 1 = (1 — s/n) n ^ 1 — >■ e~ s 
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and, averaged over the distribution of s already cited, provides rj — rjf = 0.3562. 

The above value of r\ almost perfectly matches the asymptotic values we have found by 
simulation, namely 0.357. Figure [10] shows the efficiencies of AE 2 and its constrained version. 
In the latter, sequence (1231) has been replaced by a sequence of all 2, whereas (fT2l) is replaced 
by the closest 2 Q . The results clearly show the impairment due to the constraint, but they also 
show that, also in the constrained version, our procedure is able to asymptotically reach the 
theoretical performance. 



In this paper we have presented an asymptotic analysis of Schoute's backlog estimate for 
DFA, applied to the RFID environment. The analysis shows that the asymptotic efficiency of 
this estimate is 0.311, far less than the theoretical maximum e~ l « 0.368, achieved when the 
backlog is known. The analysis shows that the performance loss is due to the slow convergence 
of the estimate and suggests how this impairment can be avoided. Using these results we have 
introduced the AE 2 , a procedure that asymptotically reaches the theoretical maximum, and that 
can be set to achieve an efficiency close to maximum for any finite value of the population 
size. We have also derived the asymptotic efficiency when the frame size is constrained to be 
a power of two, as required by RFID standards for DFA, with which the proposed protocol is 
fully compatible. Although the gain with existing mechanisms is moderate (in the 10 — 20% 
range), we remark that the proposed procedure is able to cope with any tag number, and does 
not present an hard limit on the maximum number of tags to be resolved. 



If rii and r { are both large, collided slots in frame i become distributed according to a binomial 
distribution with average rip c and variance rjp c (l —p c ), which is upper bounded by Therefore, 
being, in the approaching phase and for large r, r i+1 = Hci we have 



VI. Conclusions 



Appendix A 



VAR{r m Kr;} = H 2 VAR^n*, r*} < H 2 n. 



(24) 



Since the number of collisions can not be larger than N/2 it follows that 




(25) 
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for all i. Substituting (|25l) into (l24l) yields VAR{rj + i|n«, r«} < Nd, where d is a constant value. 
The Chebyshev's inequality used with the above bound yields 

Pr(|r i+ i -lZ i+1 \ > eN\ni,n) < 

that can be reduced to 

Pv(\r i+1 -TZ i+1 \>eN) < — . (26) 

Relation (1261) shows that, for N — > oo, we have r^/iV — > TZi/N, where the convergence is in 
probability. Much in the same manner one can show that rii/N — > Ni/N, and, therefore, we 
have rii/ri — >■ MijTZi in probability. 



Appendix B 

Here we consider sequence {IZi} during the first phase, where all the slots are collided, i.e., 
Ci = IZi and relation © becomes 1Z i+ i = E{Hci + £;} = HCi + Ej = + Hj, for i > 0. On 
the other side we have = HRi, for z > 0, with 1Z = R = r. Solving the recursions we 
get 

K^rW + J2 Hl ~ 1 ~ k ^k (27) 



fc=0 



Ri = rH\ (28) 

for i > 0. Relation (|27l) can be rewritten as IZi = Ri + Efc=o Since \E k \ < 0.5 < 1, 
and being El=o H k = (H l ~ — 1), we can write 



H % - 1 W - 1 

i2< - T7 — r < ^ < ^ + T? — r> « > °. 

.O — 1 .O — 1 



and 



l _ f(H) < Egogj < x , /(#) 



with /(if) = ~ 1 ))/(E^o i/i ) 5 having exploited ©. Since it is #< - 1< #\ for 

i > 0, we also have /(if) < 1, and finally lim P _ ) . oo (X)^ TZi)/(YlT=o R i) = l - 
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Table I 

Analytical values of the asymptotic efficiency of DFA with Schoute's estimate for different values of 

THE PARAMETER^. 





K u = 20 


K u = 25 


K u = 30 


K u = 35 


Ku = 40 


K u = 45 


K u = 47.8 


N/L(N) 


0.31125 


0.31127 


0.31125 


0.31122 


0.31122 


0.31123 


0.31125 
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Figure 1 . Efficiency versus the initial number of tags N of Schoute 's DFA mechanism for different values of the initial frame 
length r. 
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Figure 2. Efficiency of DFA with Bayes estimate and Poisson population- size distribution. 
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Figure 3. Efficiency of DFA with Bayes estimate and Uniform population-size distribution. 
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Figure 4. Average Schoute's backlog estimate A/i at the end of the frames versus time slot (N = 1000, r — 1). The dash-dotted 
line represents the relative error xlO 3 with respect to the analytical values N t . 
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Figure 5. Representation of the trajectory of the sequence {Ki}. 
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Figure 6. Efficiency of Schoute's backlog estimate versus initial traffic K. 
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Figure 7. Average backlog estimate TV; at the end of frames versus time slot, for the AE 2 algorithm (N = 1000, r = 1, 
b = 2). The dash-dotted line represents the relative error xlO 3 with respect to the analytical values iV». 
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Figure 9. Efficiency of AE 2 and of overall optimization algorithms versus the tag population size N for different values of 
parameter 6. 
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Figure 10. Efficiency of AE 2 and overall optimization algorithms versus the tag population size N for different values of 
parameter 6. 



